Privacy-preservation for Stochastic Gradient Descent Method

نویسندگان

  • Shuang Wu
  • Jun Sakuma
چکیده

The traditional paradigm in machine learning has been that given a data set, the goal is to learn a target function or decision model (such as a classifier) from it. Many techniques in data mining and machine learning follow a gradient descent paradigm in the iterative process of discovering this target function or decision model. For instance, Linear regression can be resolved through a gradient descent method that iteratively minimizes the error of the target function. Other gradientdescent-based methods include Bayesian networks induction, genetic algorithms, and simulated annealing. These traditional algorithms assume free access to data, either at a centralized location or in federated form. Increasingly, privacy and security concerns restrict this access. Nowadays, data is often distributed among multiple parties: financial data is distributed across multiple banks and credit agencies; medical records are distributed across multiple hospitals and health care institutions and so on. secure multiparty computation and privacy preservation have attracted much attention in incorporating security into machine learning and data mining algorithms. A key issue in multiparty secure methods is to allow individual parties to preserve the privacy of its data, while contributing to the computation of a global result together with other parties. Many methods have been proposed to perform secure multiparty computation. For instance, the scalar product is a basic operations required in data mining. Recently, Li considered a privacy-preserving method [7] for Batch Gradient Descent (BGD) where each party learning a local classifier on its own data, and then gave two options on how to use the computed classifiers: all parties jointly compute the final classifier within a privacypreserving way; another is using their local classifier to jointly perform on-demand prediction whenever an unknown sample’s target value is required. The solution [7] proposed by Li is attractive, however, Batch Gradient Descent (BGD) method is costly for large scale data set problem and some randomized parameters in their method have not been declared in details. Another work [1] is a differentially private Stochastic Gradient Descent (SGD) algorithm for multiparty setting proposed by Rajkumar. Here, we propose a privacy-preserving solution based on Stochastic Gradient Descent (SGD) procedure by using the paillier cryptosystem. Our proposal achieves more strict privacy than [7], in our protocol, each party knows nothing but their own data during the update process. And it is a different solution by using cryptography to ensure privacy rather than random disruption in [1].

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Two Stage Prediction Process with Gradient Descent Methods Aligning with the Data Privacy Preservation

Privacy preservation emphasize on authorization of data, which signifies that data should be accessed only by authorized users. Ensuring the privacy of data is considered as one of the challenging task in data management. The generalization of data with varying concept hierarchies seems to be interesting solution. This paper proposes two stage prediction processes on privacy preserved data. The...

متن کامل

Stochastic Gradient Descent for Relational Logistic Regression via Partial Network Crawls

Research in statistical relational learning has produced a number of methods for learning relational models from largescale network data. While these methods have been successfully applied in various domains, they have been developed under the unrealistic assumption of full data access. In practice, however, the data are often collected by crawling the network, due to proprietary access, limite...

متن کامل

Identification of Multiple Input-multiple Output Non-linear System Cement Rotary Kiln using Stochastic Gradient-based Rough-neural Network

Because of the existing interactions among the variables of a multiple input-multiple output (MIMO) nonlinear system, its identification is a difficult task, particularly in the presence of uncertainties. Cement rotary kiln (CRK) is a MIMO nonlinear system in the cement factory with a complicated mechanism and uncertain disturbances. The identification of CRK is very important for different pur...

متن کامل

Robust Decentralized Differentially Private Stochastic Gradient Descent

Stochastic gradient descent (SGD) is one of the most applied machine learning algorithms in unreliable large-scale decentralized environments. In this type of environment data privacy is a fundamental concern. The most popular way to investigate this topic is based on the framework of differential privacy. However, many important implementation details and the performance of differentially priv...

متن کامل

A Differentially Private Stochastic Gradient Descent Algorithm for Multiparty Classification

We consider the problem of developing privacypreserving machine learning algorithms in a distributed multiparty setting. Here different parties own different parts of a data set, and the goal is to learn a classifier from the entire data set without any party revealing any information about the individual data points it owns. Pathak et al [7] recently proposed a solution to this problem in whic...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012